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Abstract 

In the recent past, there has been a concerted effort to develop mathematical models for 
real-world networks and analyze various dynamics on these models. One particular problem 
of significant importance is to understand the effect of random edge lengths or costs on the 
geometry and flow transporting properties of the network. Two different regimes are of great 
interest, the weak disorder regime where optimality of a path is determined by the sum of edge 
weights on the path and the strong disorder regime where optimality of a path is determined 
by the maximal edge weight on the path. In the context of the stochastic mean-field model of 
distance, we provide the first mathematically tractable model of weak disorder and show that 
no transition occurs at finite temperature. Indeed, we show that for every finite temperature, 
the number of edges on the minimal weight path (i.e., the hopcount) is always Q{\ogn) 
and satisfies a central limit theorem with asymptotic means and variances of order O(logn), 
with limiting constants expressible in terms of the Malthusian rate of growth and the mean 
of the stable-age distribution of the associated continuous-time branching process. More 
precisely, we take independent and identically distributed edge weights with distribution E'^ 
for some parameter s > 0, where E is an exponential random variable with mean 1. Then, 
the asymptotic mean and variance of the central limit theorem for the hopcount arc s log n 
and log n respectively. We also find limiting distributional asymptotics for the value of 
the minimal weight path in terms of extreme value distributions and martingale limits of 
branching processes. 

Key words: Flows, random graphs, first passage percolation, hopcount, central limit theo- 
rem, weak disorder, continuous-time branching process, stable-age distribution theory, mean-field 
model of distance. Cox point processes. 
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1 Introduction 

The last few years have witnessed an explosion in empirical data collected on various real-world 
networks, including transportation networks like road and rail networks and data transmission 
networks such as the Internet. This has stimulated an intense inter-disciplinary effort to formulate 
various mathematical network models to understand their structure as well as the evolution of such 
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real-world networks. Rigorously analyzing properties of these models and deriving asymptotics 
as the size of the network becomes large is currently an active area of modern probability theory. 

In many contexts, these models are used to model transportation networks and understanding 
the flow carrying properties of these models is of paramount importance. Real-world networks 
are described not only by their graph structure, which give us information about valid links 
between vertices in the network, but also by their associated edge weights, representing cost or 
time required to traverse the edge. Similar questions form the core of one of the fundamental 
problems in interacting particle systems, namely first passage percolation. In brief, one starts 
with a finite network model /C„ (for example the [— n,n]^ box in the integer lattice Z^). Each 
edge e is given some random edge weight /g, usually assumed to be non-negative, independent 
and identically distributed (i.i.d.) across edges. We shall refer to le as the length or cost of 
the edge e. For any two vertices u,v & fCn, and a path P between the two vertices, the cost of 
the path f{P), is some function of the edge weights on the path (see the next section where we 
describe two natural regimes). The optimal path Popt(^^) v) between the two vertices is the path 
that minimizes this cost function amongst all possible such paths. Now fix two vertices in ICn, 
e.g., in the case of the two-dimensional integer lattice, the origin and the point (n,0). One is 
then interested in deriving properties of the optimal path between these two vertices, at least as 
the size of the network n — )• oo. 

In the modern applied context, two particular statistics of this optimal path are of importance: 

(a) f{Popt{u,v))- the actual cost of the optimal path. In many situations, this gives the cost of 
transporting a unit of flow between the two vertices. 

(b) H{Poptiu,v)): the number of edges in this path. This represents the actual amount of time 
that a message takes in getting between the two vertices. The mental picture one should have is 
that the network is transporting flow between various vertices via the optimal paths and delay, 
i.e., the amount of time that a message takes in getting between vertices is the number of edges 
or hops on the optimal path. Thus this quantity is often referred to as the hopcount. 



1.1 Weak and strong disorder 

When modeling random disordered systems, two cost regimes for the cost f{P) of a path P are 
of interest, the strong disorder and weak disorder regime. Throughout the discussion below we 
start with a connected network ICn on n vertices, with each edge assigned edge weight Ig. Fix 
two vertices denoted by 1 and 2 (say chosen uniformly at random amongst all vertices). We are 
interested in properties of the optimal path between these two vertices. Let P12 denote the set 
of all paths between the two vertices. 

Weak disorder regime: This is the conventional setup where for any path V £ V\2^ the cost 
of the path is 

U^AUP) = Y.^e. (1.1) 

eeP 

The optimal path, denoted by Pwk-dis, is deflned by 

-Pwk-dis = argmin/„k_dis(^')- (1-2) 

In our setup, the optimal path will always be unique. We are then interested in the cost and 
hopcount of this optimal path. 

Strong disorder regime: Here, for any path P G the cost of the path is given by 

/st-dis(-P) = max^e. (1.3) 

eeP 
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As before, the optimal path, denoted by Pst-disj is defined by 



St— dis — 



arg min/st-disli^)- 



(1.4) 



From a statistical physics viewpoint, one is interested in parameterizing the above problem via a 
real-valued parameter say /3, often called the "inverse temperature" of the system, such that as 
/3 — 7- oo, we get the strong disorder regime, while for finite values of (3, we have the weak disorder 
regime. One interesting way of parameterizing the above problem is to consider the original 
graph ICn with some edge random variables Wg and consider the model Gn{l3) where each edge 
is given weight le{/3) = exp(f3we). The /3 — t- oo regime then corresponds to the strong disorder 
regime with edge weights We, the /3 = regime corresponds to the graph distance regime (where 
each edge has fixed weight 1), while finite positive values of (3 are supposed to model the weak 
disorder regime and are meant to interpolate between the graph distance regime and the strong 
disorder regime. What is of paramount interest is to understand if and when a transition occurs, 
namely given some model /C„ of network on n vertices and edge distribution Wg ~ F, e.g. the 
uniform or exponential distribution, is there some finite value of /3 for which a transition occurs 
from the weak disorder regime to the strong disorder regime, where the graph begins to behave as 
in the strong disorder regime? What are the properties of the optimal paths in various regimes, 
and how does the hopcount scale as a function of /?, at least in the n — t- oo large network limit? 
Although a number of studies have been carried out at the simulation level (see e.g. [8] and the 
plethora of references therein) to understand such models of disorder in the context of various 
random graph models resulting in a fascination circle of conjectures, there has been no rigorous 
effort carried out to derive results in this context. 

Our goal is to formulate a solvable model in this context and to exhibit how such questions have 
deep connections to the stable-age distribution theory of continuous-time branching processes 
as formulated by Jagers and Nerman, see e.g. [17]. Without further ado let us dive into the 
formulation of the model in our context. 

1.2 Model formulation 

Let /C„ be the complete graph with vertex set [n] = {1, . . . ,n} and edge set En = {ij : i,j £ 
7^ j}. Each edge e is given weight le = {EeY for some fixed s > 0, where {Ee)eeE„ are i.i.d. 
exponential random variables with mean 1. The optimal path between two vertices is the path 
that minimizes the sum of weights on that path, as in the weak disorder regime. In the context 
of the above discussion of strong and weak disorder, s = corresponds to the graph distance, 
while s = oo corresponds to the strong disorder regime with edge weights Eg, the parameter 
(3 > above is equal to s and the random variable {we)e£En equals We = log (Ee), which has a 
Gumbel distribution. The advantage of this formulation is that it gives a rigorously analyzable 
model. The s = 1 regime is one of the most well-studied model in probabilistic combinatorial 
optimization (see e.g. [2, 3, 7, 11, 14, 19]) and often goes under the name of "stochastic mean-field 
model of distance". For a fixed s € M"*", we are interested in statistics of the optimal path, in 
particular, in the asymptotics for the weight and hopcount of the optimal path as n — )• oo. 

To state the results, we shall need to set up some constructs. Let {Yj}j>i be i.i.d. mean 1 
exponential random variables. Define the random variables Lj by the equation 




(1.5) 
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Let V be the above point process, i.e., 



P = (Li,L2,...). (1.6) 

While the parameter s plays an important role in our analysis, for the sake of simplicity, we shall 
omit it from the notation. The reader should keep in mind that all the important constructs that 
arise in the analysis and in description of limit results, such as the point process above, depend on 
this parameter. Now consider the continuous-time branching process (CTBP) where at time t = 
we start with one vertex (called the root or the original ancestor), each vertex v lives forever, and 
has an offspring distribution Vy ^ V as in (1.6) independent of every other vertex. Let {BPt)t>o 
denote the CTBP with the above offspring distribution. The general theory of branching processes 
(see e.g. [17]) implies that there exists a constant A = A(,s) called the Malthusian rate of growth 
which determines the rate of explosive growth of this model. In particular, if zt = |BPt| denotes 
the number of individuals born by time t, then there exists a strictly positive random variable 
W such that 

e-^'zt ^ W, (1.7) 
where denotes convergence almost surely. The constant A satisfies the equation 

oo 

^E(e"^^')=l. (1.8) 

1=1 

In this case, an explicit computation (see Lemma 3.1 below) implies that 

A = A(s) = ^(l + l/s)^ (1.9) 



Now let W'-^'' , W'-'^'' be i.i.d. with distribution W where W is as defined above in (1.7). Define the 
Cox process T^cox which, given W-^^ and W'-'^^ is a Poisson process on M with rate function given 
by 

^(x) = _VF'^'M^(''e2^^ X G M. (1.10) 
s 

Let H^^' denote the first point of the point process Vcox- 



1.3 Results 

We are now in a position to state our results. Recall that we started with the complete graph 
where each edge had distribution = E^, where {Ee)e&En ^'^^ i.i.d. exponential random variables 
having mean one. The first result identifies the limiting distribution for the minimal weight path 
while the second result below identifies the asymptotics for the number of edges on the minimal 
weight path. 

Theorem 1.1 (The weight of the shortest-vi^eight path) Let C = C{s) denote the cost of 
the optimal path between the two vertices 1 and 2. Then, 

n"C- i log 7^ A 2HW (1.11) 
A 

and 

2^(1) = \{G- log ly'^' - log VF'^' - log (l/s)) , (1.12) 
A 

where G is a standard Gumhel random variable independent ofW-^^ and W-^^ and W-^^ and W'-'^^ 
are two independent copies of the random variable W appearing in (1.7). 
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Theorem 1.2 (CLT for the hopcount) Let Hn = Hn{s) denote the number of edges on the 
optimal path between the two vertices 1 and 2, i.e., the hopcount. Then, as n ^ oo, 

Hn - slogn d 



\J log n 

where Z has a standard normal distribution. 



Z, (1.13) 



Remarks: (a) Our proof in fact shows that the convergence in Theorems 1.1 and 1.2 in fact 
occurs jointly namely 

( sr li Hn-s\ogn\ d,^„^^-^ 
nC--logn, — , — >(2^^',Z), (1.14) 



^ \J log 



n 



where the limiting random variables H'^^Z are independent. 

(b) Not much is known about the random variable W in (1.7). Indeed, the branching property 
can be used in order to show that it satisfies the relation 



W = ^&-^^mi, (1.15) 



where {Wi)i>\ is an i.i.d. sequence of random variables with the same distribution as W inde- 
pendent of {Li)i>i, and where Li is defined in (1.5). Using (1.15) and properties of functionals 
of Poisson processes one can show that the function (/>(«) = E(e~"^), defined for u G is the 
unique function satisfying the functional relationship 



exp / log 







— At^ 

ue 



dx , (j){0) = 1. (1.16) 



When s = 1, then one can see this way that W is an exponential random variable with rate 1, 
but for other values of s, we have no explicit form of W. 

(c) The distributional equivalence given by (1.12) is proved in Lemma 2.6 below. 



1.4 Discussion 



In this section, we discuss the relevance of our results and how they relate to existing literature 
as well as various conjectures from statistical physics. The standing assumption in this discussion 
is that optimal paths are uniquely defined. 



First vs. second order results. First order results (in our context showing for example that 
Hn/ slogn 1, where denotes convergence in probability) are much easier to prove than 
the detailed convergence in distribution proved in Theorems 1.1 and 1.2. One of the reasons for 
the length of this paper is that proving second order distributional convergence results in these 
sorts of problems proves to be much more difficult. Further, while in previous studies (e.g. [6] 
for various random network models) the hopcount satisfied a central limit theorem (CLT) with 
matching means and variances. Theorem 1.2 is novel in the sense that it says that, for large n, 
the hopcount has an approximate normal distribution with mean slogn and variance s^logn. 
Theorems such as Theorem 1.1 for the actual cost of the minimal weight path have been proven 
in a number of contexts (see e.g. [6, 14, 19]), but often prove quite tricky to handle due to the 
fact that we only re-center the random variables and do not divide by a normalizing factor going 
to oo. Thus, one needs to be extremely careful in analyzing the contribution of various factors as 
n — )■ oo. See e.g. [6] to see the various factors that could contribute to the limiting distribution 
in the context of exponential weights on a random graph. 
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Strong disorder regime and minimal spanning trees. Under strong disorder, it is easy to 
check using any of the standard greedy algorithms for constructing minimal spanning trees that 
the number of edges in the optimal path between any two vertices in the network has the same 
distribution as the number of edges between the two vertices in the minimal spanning tree (with 
edge weights l^)- More precisely, the optimal path between two vertices in the strong disorder 
regime is identical to the path between the two vertices in the minimal spanning tree. 

In the context of our model, under strong disorder ( "the ,8 = 00 regime" ) what is known is that 
for the complete graph, the hopcount of the optimal path H(Pst-d±s) ~ 0p(n^/^). Here, for 
two sequences of random variables (X„)„>i and {Yn)n>i, we write Xn = 0p(y„) if X^/Yn and 
Yn/Xn are tight. This was first conjectured in [8] and recently proven in [1]. The above result 
in particular shows that no transition occurs for finite values of s. It might be interesting to 
analyze the above model when s = s,i is a function of n and see when the strong disorder regime 
emerges {sn — )• 00 regime) or the graph distance type behavior is preserved (sn — )• 0). In our 
proofs, we have kept formulas as explicit as possible in order to be able to use them later on to 
study the strong disorder case or the graph distance limit. Let us now heuristically discuss the 
strong disorder regime. 

Heuristics for strong disorder. We see that the hopcount obeys a CLT with asymptotic 
mean and variance equal to s log n and log n respectively. It is reasonable to expect that the 
CLT with asymptotic mean and variance equal to s„ log n and log n remains valid when s„ 
is not too large. However, when s„ is quite large, then we should be in a phase that is close 
to the minimal spanning tree, for which the hopcount scales like n^/^ and has variance of order 
n^/'^ (since it is not concentrated). It would be of great interest to see until what value of Sn the 
CLT with parameters s„logn and s^logn remains valid. By the above, we see that for this, Sn 
cannot grow faster than n^/^ for this to be true. In analogy to the scaling for the diameter of 
the Erdos-Renyi random graph with edge probability p = {l + £n)/Ti, which has size log (e^n) 
as long as £n S> n~^^^ [20], one may wonder whether the hopcount scales in leading order as 
Snlog(n/s^), as long as Sn ^ n^/^, and where Sn plays a similar role as l/sn- 

Other edge weights. Note that in our context, the distribution of edge weights is F{x) = 
1 — exp{—x^^^) ~ x^/'^ for X close to zero. One would expect that the results in the paper carry 
over rather easily to edge weights with distribution function F for which F{x) = x^/*(l + o(l)) 
when X 4, 0. When F{x) has entirely different behavior at x = 0, other properties might arise. 
Indeed, in our current setting, we see that with high probability the shortest- weight path traverses 
only through edges of weights of order n~^, which is the size of the minimum of n i.i.d. random 
variables with distribution E'^, where E is exponential with mean 1. Thus, the benefit of using 
edges of such small weight vastly outweigh the fact that the path thus become longer (i.e. has 
Op(logn) edges). Now, when F{x) = " for some a > 0, then the minimum of n such random 
variables is (logn)~^/"(l + Op(l)), so that the minimal weight edge in the complete graph equals 
2-V'J(iogn)-V«(i + Op(l)). Here, we write that Op(6„) to denote a random variable X„ which 
satisfies that X„/6.„ 0. Thus, when a > 1, we cannot expect the optimal path to have length 
0p(log7i), as already the immediate path between vertices 1 and 2 has smaller weight than any 
path of length log n. 

Moreover, it is not hard to see that the minimal two-step path between vertices 1 and 2 has 
weig ht 2i+V«'(logn)-V« 

(1 -|-Op(l)), so that the hopcount is with high probability at most 2 + 
Thus, this simple argument proves that the hopcount is tight for all a > (as is the case for 
the CM with infinite mean degrees [5]). It would be of interest to investigate the limiting law of 
the hopcount (does the hopcount converge in probability to a constant?). In any case, it is clear 
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that weights with distribution function F{x) = " belong to a different universality class as 
compared to edge weights E^, where E is an exponential random variable and s > 0. This leads 
us to the following general math program: 

Identify the universality classes for the weights in 
first passage percolation on the complete graph. 

Extensions of our results to random graphs. In the context of first passage percolation 
on random network models, a significant amount of work both at the non-rigorous ([8, 9, 13, 22] 
and the references therein) as well as at the rigorous level ([4, 5, 6, 10, 16]) has been devoted 
to understanding such questions. What is now generally expected is that in a wide variety of 
network models and general edge costs, under weak disorder the hopcount scales as 0(logn) 
and satisfies central limit theorems as in Theorem 1.2. We hope that the ideas in this paper 
can also be applied to first passage percolation problems on various random graphs, such as the 
configuration model (CM) with any given prescribed degree distribution {pk}k>o- In [6], first 
passage percolation with exponential weights was studied on the CM with finite mean degrees, 
and it is proved that similar results as on the complete graph hold in this case. Indeed, the 
hopcount satisfies a CLT with asymptotically equal mean and variance equal to A log n, where A 
is some parameter expressible in terms of the degree distribution. We expect that when putting 
exponential weights raised to the power s on the edges changes this behavior, and the means and 
variances will become different constants times logn. While the behavior in [6] is remarkably 
universal, we expect that for weights equal to powers of exponentials, when the variance of the 
degrees is infinite, the asymptotic ratio of mean and variance will be s as on the complete graph, 
while for finite variances degrees, the ratio may be different. 

We see that the behavior of first passage percolation on the complete graph with weights E^ (as 
studied in this paper) gives rise to CLTs for the hopcount with means and variances of order log n, 
while weights with distribution function F{x) = " give rise to bounded hopcounts, as is the 
case for the graph distance when all weights are equal to 1 . Extending this to random graphs, it is 
natural to conjecture that for random graphs and weights E^, the hopcount satisfies a CLT with 
asymptotic mean and variance proportional to log n, while for weights with distribution function 
F{x) = the hopcount behaves in a similar way as the graph distance as studied for the 

CM in [10, 15, 16]. This leads us to the following question: 

Are the universality classes of first passage percolation 
on the configuration model equal to those on the complete graph? 

1.5 Proof idea and overview of the paper 

For the sake of notational convenience, we shall rescale each edge length by a factor (n — 1)*, so 
that each edge has distribution (YeY where Yg are distributed as exponential random variables 
with mean n — 1. This does not change the optimal path while the cost of this path is scaled up 
by (n — 1)^. For the rest of the paper we shall think of the edge weights as lengths which thus 
induce a random metric on the complete graph and shall often refer to the optimal path between 
two vertices as the shortest path between them. We are interested in the optimal path between 
vertices 1 and 2. Consider water percolating through the network started simultaneously from 
two sources, vertices 1 and 2, at rate one. Then the first time of collision between the two flow 
processes, namely the first time the flow percolating from vertex 1 sees a vertex already visited 
by the the flow percolating from vertex 2 (or viceversa) gives the shortest path between the two 
vertices. Let z"''^' and 2;"''^' denote the number of vertices seen by the flow cluster by time t 
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for the flow emanating from vertex 1 and 2, respectively. For large n, the flow clusters look like 
independent versions of the CTBPs as formulated in Section 1.2, at least until they collide. A 
coupling is rigorously formulated in Sections 2.1.1 and 2.1.2. Further, they collide only when 
both clusters reach size Qp{^/n). At a heuristic level, at any time t, the rate of collision 7n(t) in 
a small interval [t, t + dt) should be 

/ n,(i) n,(2) \ 

7n(t)oc r^ Ut. (1.17) 



Now we use the fact that for large t, z"''"' ~ W^^^e^^, where VF^*' is the limiting random variable 
for the associated CTBP deflned in (1.7), to see that 

7n.(x) ^ . (1.18) 

n 

Thus collisions happen at time (2A)~^ log n it Op(l), where Op(6„) denotes a random variable Xn 
for which \Xn\/bn is tight. If we let T12 denote the collision time, then the length of the optimal 
path equals Wn = 2Ti2. The above argument gives asymptotics for the collision time and hence 
the length of the optimal path. 

For the hopcount, we shall use general branching processes arguments to show that at large time 
t, if one is interested in the distribution of the generations (in our context this gives the number 
of individuals at various graph distances away from the root, namely the originating vertices 1 
and 2), the contribution to the population comes from generations t/ f3{s) and the deviations are 
normally distributed around this value. Here the constant /3(s) > denotes the mean of the 
stable-age distribution of the associated branching process. Intuitively, the optimal path between 
vertex 1 and 2 as constructed via the above simultaneous flow picture looks like the following: 
Suppose the connecting edge between the two clusters (fi,f2) arises due to the birth of a child 
to vertex v\ in the flow cluster of vertex 1 and this child, V2 has already been visited by the flow 
from 2. This happens at around time (2A)~^ logre±Op(l). The hopcount of the optimal path 
is given by the equation 

Hn = Gi + G2 + l, (1.19) 

where Gi and G2 are the generations of vertex vi and V2 in flow cluster 1 and 2, respectively. Thus 
understanding the distribution amongst generations in the coupled branching processes paves the 
way to understanding the hopcount. The remainder of this paper involves the conversion of the 
above heuristic into a rigorous argument. The organization of rest of the paper is as follows: 



• In Section 2.1, we shall couple simultaneous flow from two vertices on /C„ with CTBPs and 
show that the difference is negligible; 

• Section 2.2 shows that the above coupling incorporated with technical results from CTBP 
theory give us asymptotics for the re-centered length of the optimal path, namely Theorem 
1.1. 

• Section 2.3 shows how the distribution of individuals among different generations in the 
associated branching process proves Theorem 1.2. 

• Finally, Section 3 proves all the CTBP results we need to carry out our analysis. This 
section is the most technical part of the paper and the point of organizing the paper in this 
fashion is to motivate the various results that are proved in Section 3. 
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2 Proofs 



This section proves the main results. Proofs of the CTBP results needed are deferred to Section 
3. 

2.1 Dominating graph flow by continuous-time branching processes 

In this section, we describe a coupling between the flows started from vertices 1 and 2 and 
their corresponding independent CTBPs with offspring distribution given by the point process in 
(1.6). We shall first start with the fiow started from one vertex and then extend this to the flow 
simultaneously from two vertices. 

2.1.1 Expansion of the flow from a single vertex 

We start with some notation. Recall that /C„ denoted the random disordered media represented 
by the complete graph where each undirected edge (i,j) has edge length Efj where Eij are i.i.d. 
exponentially distributed with mean n—1 (alternatively, with rate l/(n— 1)). These edge lengths 
make /C„ a metric space (with random geodesies). Let the index set of ICn be [n] := {1, 2, . . . , n} 
and fix vertex 1. Think of this vertex as an originator of flow of some fluid which percolates 
through the whole network via the geodesies at rate 1. Let ii = l,i2, ■ ■ ■ € [n] be the vertices in 
sequential order seen by the flow. For t > 0, let SWG^^' be the shortest-weight graph between 
vertex 1 and all the vertices that can be reached from 1 by shortest-weight paths of length at 
most t. More precisely, SWG^^' consists of these shortest-weight paths and the weights of all of 
the edges used for them. Let {-E'*}j>ij>i be a doubly inflnite array of mean 1 exponential random 
variables. Then, by the properties of the extremes of n — 1 i.i.d. exponential random variables, 
each with mean n — 1, it is easy to see that the neighbors of 1 have distances from 1 distributed 
as 

^M = m^(M^' + J^^2^')^■■■ (2-1) 

Similarly, the distribution of distances from vertex (the k^^ vertex reached by the flow from 1) 
to vertices other than those already seen by the fiow, is distributed as 

Call the above the immediate neighborhood process of vertex k. Note that for each k, by the 
memoryless property of the exponential distribution, the identity of the end point of each edge 
in the above point process is uniformly distributed among all [n] \ {ii,i2, ■ ■ ■ ik} vertices which 
have not been seen at the time when the fiow hits vertex i^. Our aim is to couple this process 
with a CTBP with offspring distribution given by the point process V defined by 

V = {{EiY, {El + ^2)^ {El + E2 + E^Y, . . .}, (2.3) 

where {Ei}i>i are i.i.d. exponential rate 1 random variables. Comparing (2.3) with (2.1) and (2.2), 
we see that, intuitively, the SWGJ^' should be stochastically smaller than the corresponding CTBP 
driven by offspring distribution V. The reason is that when the flow starts, then the number of 
edges it has to explore from vertex 1 is n — 1, but as SWGj^' increases with time, the number 
of edges originating from each new vertex is strictly smaller than n — 1 due to vertices already 
explored by the flow. Thus, the points are being depleted. We shall show that asymptotically for 
large n, the difference is negligible. To do so, as the flow explores /C„, we shall enlarge the graph 
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K-n with new artificial vertices to compensate for SWG^^' using up vertices in /C„ and effectively 
counteracting the depletion of points effect. For this, we shall need the following randomization 
ingredients: 

(i) The complete graph /C„ with random edge weights; 

(ii) An infinite array of i.i.d. exponential random variables {-E'i,j}ig[n],j>n.+i each with mean 
n — 1; 

(iii) An infinite sequence of independent branching process {BPi(-)}i>.„+i each driven by the 
offspring distribution in (2.3). 

Before diving into the construction, we shall need the following simple lemma which follows 
directly from the memoryless property of the exponential distribution: 

Lemma 2.1 (Powers of exponential distributions) (a) Consider the random variable 
where E has an exponential distribution with mean n — 1. Then, for any fixed r > 0, the condi- 
tional distribution of E^ \ E^ > r equals that of {E^ + r^^'^), where E is an independent random 
variable with exponential distribution with mean n — 1. 

(b) Consider the surplus random variable {E^ — r) \ E'^ > r. This random variable has the same 
distribution as the first point of a Poisson point process with rate 

kr{x) = . ^ A r + xf/'-^, x>0. (2.4) 



We shall use part (a) of Lemma 2.1 in the construction of the coupling while we shall use part 
(b) in the proof of the distributional result for the optimal weight. We start by proving Lemma 
2.1: 

Proof. Part (a) is immediate from the memoryless property of the exponential random variable. 
For part (b), we note that 

¥{E' - r > X \ E' > r) = ¥{E > {x + rf' \ E > r^/') = e-K^+r)'/'-r^^']/in-i) ^ (2.5) 

while the probability that a Poisson point process with rate (2.4) has no points before x equals 

Thus, the first point of this Poisson point process has the same distribution as the conditional 
law E^ — r \ E^ > r. u 

Construction of the coupling: This proceeds via the following constructs: 

(a) Artificial inactive vertices: Consider the flow traveling at rate one from vertex 1 on /C„. 
Let 2;"''^' denote the number of vertices in SWG^^^ To evoke branching process terminology, we 
shall often refer to this as the number of vertices born in the flow cluster of 1 by time t. For 
1 < k < n, we define the stopping times 

T^ = mi{t : zr = k}, (2.7) 

so that T" = 0. Now consider the fiow from vertex 1. For k > 2, when the k^^ vertex i^ 
is discovered by the fiow at time TJ}, create a new artificial vertex labeled by n + k — 1. Let 
a{ik) denote the vertex in SWG^i to which vertex i^ is attached. Then note that for all ij 7^ 



10 



a(ifc) G SWG™, by Lemma 2.1 and, conditionally on SWG^^', the edge lengths of edge {ij,ik) 

k 

have distribution {[t — T"]!/'^ + Ey where E has an exponential distribution with mean n — 1. 
For the new artificial vertex n — k + 1, we attach edge lengths from each vertex ij G SWG™ of 

■' ^k 

length {[t-T^f/' + Ej, 

n-k+iY where the Ej^n-k+i are exponential random variables as described 
in the randomization needed for the coupling, and where we recall that T" denotes the time of 
discovery of vertex ij. We shall think of the flow having reached a distance t — TJ^ on this edge. 
At the time of creation, we shall think of these artificial vertices as inactive as the flow has not 
yet reached this vertex. Think of these vertices as part of the network and the flow trying to get 
to them as well. Note that eventually the flow will reach these inactive vertices as well. Whenever 
the flow reaches an inactive artificial vertex, we shall think of this vertex becoming active, i.e., it 
is activated. Let At denote the set of active artificial vertices. For > 1, let 

Tj!'* := inf{t :\At\= k} (2.8) 

be the time of activation of the k^^ artificial vertex. Note that in this construction, edges exist 
only between vertices in [n] and artificial vertices, no edges exist between artificial vertices. 

(b) Activation of artificial vertices: Note that activation of inactive vertices happens at times 
T^'* via an edge from a vertex in SWCj^",* C [n] to an inactive artificial vertex > n+1. Suppose 
at this time the set of artificial vertices (active and inactive) is {n + 1, n + 2, . . . , n + j{Tj^'*)}- 
When this happens the following constructions are performed: 

(1) Remove all the edges from vertices in [n] to d^ (other than the one that the fiow used to 
get to it); 

(2) Create a new inactive artificial vertex n + j(T^'*) + 1. Just as before, create edges between 
each vertex i E [n] and vertex n+j(T^'*) + l with edge lengths distributed as ([t — T^'*]^/'^ + 
^i,n+j[Tj!-*)+iY and think of the fiow as having already traveled t — Tj^'* on it; 

(3) At this time, start a CTBP BPfc(-) with dk as the ancestor. The vertices born in this 
branching process have no relation to the fiow on /C„ and associated inactive vertices. For 
time t > T^'*, we shall call all the vertices in BPfc(i), other than dk, the descendants of 
vertex d^ at time t. 

Let VAt denote the set of all descendants of the associated CTBPs of active artificial vertices at 
time t and let 

BPf ^ = SWGf ^ U A U VAt. (2.9) 

Let Zj^' = |BP^^^| denote the number of vertices reached at time t. The following proposition 
identifies properties of the above construction which shall all be crucial in our analysis. We shall 
prove this proposition in detail since we later we shall use an almost identical proposition in the 
context of fiow from two vertices which we shall state without proof in Section 2.1.2 below. 

Proposition 2.1 (Properties of the coupling) In the above construction, the following holds: 
(a) The process {BP[^^}t>o is a CTBP driven by the point process V in (2.3). The process 
{SWGj^'}j>o is the shortest weight graph process of the fiow emanating from vertex 1. As is 
obvious from (2.9), there is stochastic domination in the sense that for all times t > 0, 

SWGf' C BPf'. (2.10) 
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In particular, Z- 



n,(l) 



ISWG 



(1)1 



< Z- 



(1) 



|BPf' I for allt. 



(h) Let A = \{s) he the Malthusian rate of growth o/ BPj^* as defined by (1.9). Then, given any 
e > 0, there exists > such that for times tn = (2A)~'^ logn — Cg 



liminfP(|A„| = 0) > 1 - e. 



(2.11) 



(c) For any fixed i? G M, letting tn = (2A)^"'^ logn + B, the sequence of random variable \ At,^ \ + 
is a sequence of tight random variables. Since the processes {\At\ + \T>At\)t>o Q'^e mono- 
tonically increasing in t, (2.9) implies that sup«(^(z 
n — )• oo, 



(1) 
t 



z^ ) is tight and, in particular, as 



sup 



n,{i) 

h 



1 



0, 



(2.12) 



Note that if |A,J = 0, then SWGf = BP^'^ for all t < i„, so that part (b) yields that there is 
little difference between the SWG and the CTBP up to time (2A)~^ logn — C^. 

Proof. Part (a) is obvious from construction. To prove part (b), note that by construction, 
if 2;"''^' = k, then the chance that the next vertex is an artificial inactive vertex is exactly k/n. 
Thus, if z^^^^ = kn then 

\AtJ = ^Ij, (2.13) 

i=i 

where Ij are Bernoulli j/n random variables. Now to choose Cg, first choose C* > so small 
that exp(— C*/2) > 1 — e/2. Since ^J^'^' < z'l]^ and for the process {z'l^^}t>o the asymptotics (1.7) 
hold, we can choose C* such that 

Fiz^f^ > C:V^i) < e/2. (2.14) 

Then, 

p(iAj > 0) < p(iAj > o,^:^^' < c:v^)+nz^f^ > c:v^) 

< (l-exp(-C;/2))+e/2 <e, 

where the second inequality follows using a Poisson approximation in (2.13) and (2.14). This 
proves part (b). 

Finally to prove part (c), we note the following: 



• Using part (b), we choose Cs so that with high probability no artificial vertices have been 
activated by time (2A)~^ logji — C^; 

• Using (2.13) and ideas similar to the above argument one can show that the number of 
active artificial vertices by time i„ = (2A)^^logn + B can be stochastically dominated 
with high probability by a Poisson random variable Xg with mean C{B) for some function 
B^C{B). 



These two observations together imply that with high probability 

\AtJ + \VAtJ ^st |BP.(^ - Cs)\, (2.15) 
i=i 

where BPj(-) are independent CTBPs driven by V, independent of Xg which is Poisson with 
mean C{B) and denotes stochastic domination. This proves part (c). ■ 
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2.1.2 Simultaneous expansion and coupling 

Let us now show how the above coupling can be extended to flow originating from two vertices 
1,2 simultaneously. We shall couple the flow to two independent CTBPs {BPj''}j=i^2- All the 
ingredients of randomness shall be the same as in the previous section, namely, (i) the com- 
plete graph fCn with random edge lengths; (ii) the infinite array of exponential random variables 
{-Bjj}(i<j<„.j>„_i_i); and (iii) the infinite sequence of independent CTBPs {BPj}j>i driven by V. 
Think of flow now emanating from the two sources 1, 2 simultaneously at rate one exploring the 
shortest weight structure about the two sources. We shall stop the flow when there is a collision, 
i.e., the flow from one vertex sees a vertex seen by the flow from the other vertex. As before, we 
let SWG^'^ denote the shortest weight graphs up to time t explored by the flow from each source 
z = 1 , 2 and let 

SWGt = SWG<'^ U SWGf . (2.16) 
Let z"'''' = ISWGf^l and = z"'*'' + Now let denote the stopping time 

T," = inf{i : = k}, (2.17) 

so that now T2 = 0. Let the vertex discovered at time and attached to one of the two flow 
clusters be i^ G [n]. We shall call this the time of birth of the vertex i^- Extra care is needed as 
subtle issues of double counting of edges may arise. 

The construction proceeds as before via two ingredients: 

(a) Artificial inactive vertices: By convention, we shall think of the edge between 1 and 2 to 
belong to the flow from vertex 1. To compensate at time 0, we shall add a new artificial inactive 
vertex labeled by n + 1. Compared to the other artificial vertices this shall be special in the sense 
that vertex 1 will not have an edge to this vertex (or the artificial vertices that replace this vertex 
when the fiow reaches this vertex). At time 0, attach an edge (2,n-|- 1) of random length E2j^^i- 
Now start the fiow from the two sources on the vertex set [n] U {n + 1}. The fiow percolates from 
these two sources on the (expanded) network discovering new vertices, both actual vertices in [n] 
as well as artificial vertices. Let SWGj denote this flow process with z^'* = |SWG*| and let 

f}: = mi{t:\zr\=k}. (2.18) 

Let ik denote the vertex discovered by the flow at time TJ} (this vertex could either be an actual 
vertex in [n] or an artificial inactive vertex). Create a new artificial vertex labeled hy n + k. Now 
if ik is in SWGf then remove all the edges between ik and all the vertices in SWG^^ (namely 

real vertices in the actual graph [n] which are part of SWGj^' that have already been explored 
by the flow from 2). (Do the exact opposite if ik G SWGj^\) The edges {v,ik) for v G SWG^^ 

are quite special (see the beginning of Section 2.2). Call these the potential connecting edges as 
these are the edges through which collision of the two flow clusters may happen. Also perform 
the following constructions: 

• If ifc 7^ n + 1 or any of the replacements of n + 1 (this term is defined below), then attach 
edges between the artificial vertex n + k and all ij G SWGy„ with edge lengths {[t — T^.]^/^ + 

Ei-^n+kY- The fiow would have already fiowed till distance {t — Ti.) on this edge to this 
new vertex. 

• If ifc = n+1, then replace this by a new vertex n+k. This vertex will be called a replacement 
of the special artificial vertex n + 1. Also replacements of such replacements shall be called 
replacements. Remove all edges from ij G SWGj.n to ik and add back edges from these 

vertices excluding vertex 1 to vertex n+1 with edge lengths {\t — T^]^/^+Ei . n+kY ■ This can 
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be understood by noting that the flow would have already reached up to distance {t — Ti. ) 
on this edge to this new vertex. 



Every new artificial vertex when it is born is inactive. Whenever the flow reaches an inactive 
artificial vertex we shall think of this vertex becoming active and belonging to the flow cluster 
from which this artificial vertex was reached. Let A^'^ denote the set of active artificial vertices 
corresponding to flow cluster i = 1,2 at time t and let At = A^^^ U A'j^^ be the set of artificial 
vertices. For /c > 1, we let 

T^"'* := inf{t : |A| = k} (2.19) 

be the time of activation of the k^^ artificial vertex. Note that, as before, in this construction 
edges exist only between vertices in [n] and artificial vertices, no edges exist between artificial 
vertices. 

(b) Activation of artificial vertices: Note that the flow will eventually reach inactive artificial 
vertices. When this happens say that activation happens. This happens at times Tj^'* via an edge 
from a vertex in SWGj-i.* C [n] to an inactive artificial vertex dk > n + 1 from one of the two 
flow clusters. When an artiflcial vertex gets activated, it belongs to the flow cluster that activates 
it and so do all its descendants (the notion of a descendant is defined below). Suppose that at 
this time, the set of artificial vertices (active and inactive) is {n + 1, n + 2, . . . , n + j(T^'*)}. As 
described above, this inactive artificial vertex is replaced by a new inactive artificial vertex with 
appropriate edges and edge lengths. 

Further at this time, start the CTBP BP,fc(-) with d^ as the ancestor. The vertices born in this 
branching process have no relation to the flow on /C„ and associated inactive vertices. For time 
t > T^'* we shall call all the vertices in BPfc other than df^ the descendants of vertex d^. 

Let "DA^^ denote the set of all descendants of the associated CTBPs of active artificial vertices 
at time t in fiow cluster i = 1, 2 and define the processes 

BPj"' = SWGf' U 4'' U VA['\ i = 1, 2. (2.20) 

Let zl"^ = |BPf'|. Finally let BPt = BPj" U BPf denote the fuH fiow process. This completes 
the construction of the coupling. 

The following proposition collects the properties of our construction that we shall need. It is 
analogous to Proposition 2.1 and we shall not give a proof. Recall that T12 denotes the collision 
time of the two fiow processes. 



Proposition 2.2 (Properties of the coupling) In the above construction, the following holds: 

(a) The processes {BPj''}t>o are independent CTBPs driven by the point process V in (2.3). The 
process {SWGj'^}o<t<Ti2 th^ shortest weight graph process of the flow emanating from vertex i 
till the collision time. As is obvious from (2.20), there is stochastic domination in the sense that 
for all times t > 0, 

SWC^^'cBPf'. (2.21) 

In particular 2:"'''' < z^'' for all t > 0. 

(b) Let A = \{s) be the Malthusian rate of growth 0/ BPj'' as defined in (1.9). Then, given any 
e > 0, there exists Cs such that for times tn = (2A)~^ logn — Ce, 

liminf P(ri2 > t^, U^l = 0, \A?^\ = 0) > 1 - e. (2.22) 
Note that if l^^^l = then SWGf = BPf^ for all t < t„. 

(c) For any fixed B £ W, let t* = (2A)^^ logn + B and let tn = T12 A t* . Then the sequence 
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of random variable | + | is a sequence of tight random variables. Since the processes 

{\At\ + \'DAt\)o<t<Ti2 ("•'^^ monotonically increasing in t, (2.20) implies that supj<j^(z^ 
is tight, and, as n ^ oo, 



(i) 



sup 

t<i„ 



(2.23) 



2.2 Analysis of the weight of the optimal weight 

Before proceeding to the main proposition in this section, we shall derive an important property 
of the above construction. When a vertex, say v £ [n], is born into one of the flow process (to fix 
ideas say into the flow cluster of vertex 1) at some time t, then note that the edges it has at this 
time are 

• edges to inactive artificial vertices. 

• edges to all vertices in [n] \ SWGj. 

For any vertex v G SWG^^^ and, for any vertex u € [n] born into the flow cluster originating 
from vertex 2 at some later time s > t, we say that the edge connecting v to u is assigned to 
vertex v and not to u. Similarly, if vertex u is born into the flow cluster starting from 2 before 
vertex v which is born into flow cluster from vertex 1, then say that the edge {u,v) is assigned 
to vertex u. Now for any time t and any vertex i G SWG^^' C [n], let Nt{i) denote the number 
of edges with end points in SWG^^' which are assigned to it. Similarly, for a vertex i G SWGJ^\ 
Nt{i) is the number of vertices in SWG^^^ assigned to it. Recall that our aim in sending the flow 
simultaneously is to analyze the collision time, namely, the first time when an edge, which we 
shall refer to as the connecting edge, forms between the two flow clusters. For any given time t 
and V G SWGj'\i = 1,2, define the (random) set 

Mt{v) = {u€ SWGf"'' : edge {u,v) assigned to v} = {u € SWGf"'' : r„ > T^}, (2.24) 

where, from now on, we shall use to denote the time of birth of vertex v into the flow process 
{SWGt}t>o and we recall that SWGt = SWGf' U SWGf . 

The importance of these connecting edges is as follows: Fix some time t and vertices i G SWG^^^ 
and j G SWGJ^' with Tj > Ti so that the edge between them is assigned to vertex i. Note 
that up till time Tj, the flow was proceeding on the edge between them at rate 1 from vertex 
j. Now at time Tj the flow has reached the edge from the opposite side (i.e., from vertex j) 
and is proceeding through the edge from both end points. Thus while the flow through all other 
non-potential connecting edges proceeds at rate 1, the flow through this edge proceeds at rate 2. 
For any time x + Tj, and using Lemma 2.1(b) with r = Tj — Ti and the fact that the flow now 
proceeds at rate 2 and not 1, the intensity function for the formation of this edge at this time is 

A(,,,)(x + T,) = j^^-^{{Tj - T,) + 2x)V-i, x>Q. (2.25) 
In particular, for t > Tj, 

W = - + - T,))"'-' = J^^^iit - T^) + {t- T,))V-i. (2.26) 
This fact leads to the following proposition: 
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Proposition 2.3 (Collision time distribution) If T12 denotes the collision time, then with 
respect to the filtration generated by the flow process, T12 has the same distribution as the first 
point of a Poisson point process with rate function given by 

AnW = ^(;^ E E {[t-T,] + [t-T,]f^-K (2.27) 
jeswG^^' ieswcf ' 



Remark 2.4 (Extension to other graphs) Note that a similar formula as the above remains 
valid for any finite graph with i.i.d. El edge weights where are exponential random variables, 
where the sum over {i,j) is restricted to € En, i.e., the sum is only taken over the edges of 
the graph. This will be used in our analysis of more general random graph models. 



Proof. Using (2.25), Lemma 2.1 and the fact that for a finite number of independent Poisson 
point processes, the first point to occur in any of these processes has the same distribution as 
the first point in Poisson point process with rate given by the sum of rates of the corresponding 
point processes, we have that 

oY- {[t-n + [t-T,]Yl^~^ ^ ^ ([^ _ T,] + [t - r,-])V-^ 

(2.28) 

where we recall that Mt{i) denotes the set of vertices in the other flow cluster assigned to i. Now 
note that for every pair of vertices {i,j),i G SWGf\j G SWGf^ either i e Mt{j) or vice-versa 
and only one of these facts can happen. Rearranging the above equation gives the result. ■ 

In Section 3.3 below, we shall prove the following result concerning the convergence of the two- 
vertex characteristic: 



Theorem 2.5 (Convergence of two-vertex characteristic for BP) Consider two indepen- 
dent CTBPs BP['\ i = 1, 2, as before. Let W^'\i = 1,2 be the almost sure limits ofe ^^z^\ Then, 

e"^^* E E - ^il + - ^^\)^''~^ ^ XW^'^W^^\ (2.29) 
where Wi are the a.s. limits o/e~'^*|BPj''| and are i.i.d. with the same distribution as W in (1.7). 



Now we are ready to prove Theorem 1.1: 

Completion of the proof of Theorem 1.1: First consider the rate function A„(t) of the 
collision time given in Proposition 2.3. By Proposition 2.2, in the summation arising in this 
rate function, we can replace the terms SWG^'^ by BP^*^ as the effect on the rate function is 
asymptotically negligible, where BPj'^ are the independent CTBPs that have been coupled with 
SWG^'^ to understand the optimal path on /C„. Note that while these CTBPs intrinsically depend 
on n since we have used the randomization in /C„ to construct the CTBPs. By (1.7), 



e-^*|BP 



(i)| 



(2.30) 



where Wn^ are independent and identically distributed as the limit variable in (1.7). 
Now, Theorem 2.5 implies that for any fixed B > and any x G [—B, B], we have. 



sup 



2A 

A„((2A)-i logn + x) - — l^w^^2)e2Ax 

s 



0. 



(2.31) 
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Comparing the above with the definition of the Cox process in (1.10) completes the proof subject 
to Theorem 2.5. Theorem 2.5 is proved in Section 3.3. ■ 

For future reference we define the two-vertex characteristics X(jj)(0 by 

Xi^,){t) = {[t-T,] + [t-T,])'/^^K (2.32) 

We shall now quickly prove the distributional equivalence (1.12). 

Lemma 2.6 (The limit of the shortest weight) The first point H*^^ of the Cox point process 
with rate 7(-) as in (1.10) satisfies the distributional equivalence in (1.12). 

Proof. Since H'^' is the first point of the Cox process with rate function 7 in (1.10), we have 
for any fixed ?/ G M, conditional on W^^\ W'''^\ 



y\W^-'\W^^^) = exp ^{x)di^ = exp (^--W'^'^W^^^e^^v^ , (2.33) 



so that 

1 VF'^'P^'^' 



(14/ V ; 1/1/ \ 
> X - 4 log 1^-— — I W'^'\W^^A = exp(-e2^^) = P(G/(2A) > y), (2.34) 
2A s / 

where G has the standard Gumbel distribution. This proves the result. ■ 



2.3 Hopcount analysis 

As before, we let T12 be the collision time between the two flow clusters and suppose the collision 
happens via the formation of an edge (fi, "^2) where vi G SWG^^'^ and V2 G SWG^^'^. For i = 1,2 
let Gi denote the number of edges on the path from vertex i to Gi so that the hopcount is given 
by 

Hn = Gi + G2 + l. (2.35) 

To prove Theorem 1.2 it suffices to show that, for every fixed, v^x^y G M a-iid writing — 
(2A)-ilogn, 

P {Ti2 <tn + r,Gi< Xstn + xs^/Xt^, G2 < Xstn + ys^/xC) ^ Fi2(r)$(x)$(y), (2.36) 

where Fi2{-) is the distribution of the random variable appearing in Theorem 1.1 and $(•) denotes 
the standard normal distribution function. 

For fixed time t and v G SWGj'\i = 1,2, let G{v) denote the number of edges in the optimal 
path between v and vertex i which started the flow. For any fixed x G M, let 

SWGf>(x) = {ve SWGf : G{v) < Xst + xsVxi}. (2.37) 

By Proposition 2.3 and properties of a finite number of Poisson processes, we have, for any fixed 
t, 

P(Ti2 e[t,t + dt), Gi < Xst + xsVxi, G2 < Xst + ysVxi \ SWG*) 

' KMd^) A„(t) ^g.^"°!' ,t. (2.38) 

where Xijit) was the two- vertex characteristic defined in (2.32) and Xn{t) was the rate defined in 
(2.27). Thus, to complete the proof of (2.36), it is enough to show the following: 




17 



Theorem 2.7 (CLT from two-vertex characteristic) The two-vertex characteristic satisfies 
the asymptotics, for t — t- oo, 

Theorem 2.7 is proved in Section 3 and completes the proof subject to Theorem 2.7. In fact, 
together with Theorem 2.5, (2.38) proves the joint convergence of the length of the optimal path 
and the hopcount as remarked upon below Theorem 1.2, where the limits are independent. m 



3 Continuous- time branching process theory 

In Sections 2.2-2.3, we have reduced the proof of our main results to the proof of Theorems 2.5 
and 2.7. This section deals with a proof of Theorems 2.5 and 2.7. This section is organized as 
follows. In Section 3.1, we investigate properties of our CTBP. In Section 3.2, we investigate one- 
vertex characteristics. In Section 3.4, we analyze the two- vertex characteristic and prove Theorem 
2.5. In Section 3.5, we derive a CLT for the two-vertex characteristic and prove Theorem 2.7. 

3.1 Intensities and limiting parameters single CTBP 

We shall first state and prove various results that we shall require regarding a single branching 
process. Let BP be a continuous-time branching process driven by the offspring point process V 
(i.e., the points given by {Li,L2, . . .) as in (1.5)) and let n denote the mean intensity measure of 
this point process, i.e., 

fi[0,t]=Emi:U<t}). (3.1) 

Now, 

Define the Malthusian rate of growth A = A(s) as the unique positive constant such that the 
measure 

u{dt) = e-^^^i{dt). (3.3) 

is a probability measure. A simple computation shows that this is equivalent to (1.8). The 
following lemma collects some properties of this probability measure and the constant A: 

Lemma 3.1 (Identification of limiting parameters CTBP) 

(a) The constant A = A(,s) is given by (1-9). 

(h) The probability measure u[dt) is a Gamma distribution with density 

(c) Let Pi and /32 denote the mean and the standard deviation of v. Then 

A = {s\r\ h = i^Vsy'. (3.5) 

(d) Let /i*-' denote the j-fold convolution of the measure fi. Then 

r(j/s) 
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Proof: To prove part (a), note that since the sum of k independent exponential random variables 
follows the gamma distribution, a simple computation gives that 



oo 1 /"OO J.I 

L I \j-s _-i V ^ L 



1 = Ve e-^^0 = V / e-^*^e-*-^dt = / e'^^^e"* V -^dt 

ti ^ ^ {^-ly. Jo Ui^-^y- 

OO /"OO /"OO 



JO 

= A-i/^r(i/s)/s = A-i/^r(i + 1/s), 

as required. Parts (b) and (c) are trivial. To prove part (d) note that, by (3.2) and [12, 4.634], 
(du) = dus~^ / ^'""^y^ ^dui---duj (3-7) 

J Ul-\ \-Uj=U 

_ u^/'-^s~^T{i/sydu _ uJ'/«-ir(i + i/sydu _ u^/'-^\^/'du 

- njjT) - f{jjs) T{j/s) ■ 



3.2 Analysis of single-vertex characteristic 



We first state a general theorem for single vertex characteristics of the CTBP. Consider a function 
X'- — >• R"*" which is continuous almost everywhere which (a) increases at most polynomially 
quickly to oo; and (b) is integrable with respect to the Lebesgue measure near zero. Let us call 
such functions regular single-vertex characteristics. For the branching process BPt, call 

= E - ^^•) (3.8) 

iGBPt 

the branching process counted according to characteristic x- Branching processes counted by 
characteristics are some of the fundamental objects studied by Jagers and Nerman, see e.g. [18]. 
For example, taking x{x) = 1, we obtain zf = |BPt|, the size of the branching process at time 
t. In order to investigate the hopcount, we will need to analyze not just branching processes 
counted according to characteristics as above but also generation weighted characteristics. Given 
a regular single vertex characteristic x any fixed a € M, define 

z}{a)= ^ a^(^\(t-r,), (3.9) 

j6BPt 

where, as before, Tj denotes the time of birth of vertex j, while G{j) denotes the height or 
generation of vertex j. Given any characteristic Xi when we write zf without the argument a, 
we imply the branching process counted in the usual way as in (3.8), while when we have an 
argument a, namely zf{a), we refer to the branching process counted by a generation weighted 
characteristic as in (3.9). 

The following proposition is adapted from the general theory of CTBPs, see e.g. [17, Theorem 
5.2.2] for part (a) (or see the nice treatment in [18, Theorem 3.4]). We shall give a complete 
proof since branching processes counted by generation weighted characteristics have not been 
previously analyzed. These constructs shall be crucial for us in order to prove the CLT for the 
hopcount. 



Proposition 3.2 (Mean and co-variances of one-vertex characteristics) For regular de- 
terministic single-vertex characteristics Xj 
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(a) the expectation m^(a) = E[z^(a)] satisfies 



mna)=E[zf{a)] 



rt °° 

■^0 ,=0 



(3.10) 



(b) the covariances between zp(ai) andzf'^{a2) satisfy 



Jo 



where v i— t- /iai,a2('y) "i^ function 



•5 Jo 

an(i we define the generation-weighted intensity measure fia by 



•5 Jo 



fiaidu) = i^L*^{du). 
j=0 



(3.11) 



(3.12) 



(3.13) 



Proof. The proof of part (a) follows the same strategy as in [18, p. 228], where the case 
a = 1 was proved. Indeed, there it is shown that the intensity measure for individuals in the k^^ 
generation equals /i*'^. Thus, 



mi 



{a)=E[zna)] = J2^'^[j2^(*-^^)] = / x(i-n)^aV'('in)- (3.14) 



For part (b), we follow the identification of Var(2;^) in [18, Theorem 3.2 and Corollary 3.3]. We 
use the covariance partition 

Cov(zf (ai),zp(a2)) = Cov(E[zP(ai) | Ao],E[zf{a2) \ Aq]) + E [Cov(zP (ai), zf (aa)) I Aq], 

(3.15) 

where ^o is the cr-algebra generated by the lives of the individuals in the first generation (the 
root is considered to be in generation zero). Then, the branching property of a CTBP gives that 



zf{a) = x{t) + a ^t-T,(i;' 
j-- G(i)=i 



(3.16) 



where ((z^ (j; ai), (j; 02)))^- is, conditionally on ^o, a sequence of i.i.d. random processes 
with law ((^P (ai), zP(a2)))j>o- Therefore, 



Cov(zf (ai),zf (02) I Ao) = aia2 ^ 0^^^^.2(01,02), 

j--GU)=i 



where we abbreviate 
Thus, 



Cf '^^^(01,02) = Cov(zf (oi),zf (02)). 



E 



Cov{zf'{ai),zf{a2)\Ao) =0102/ C^i;^'(ai, 02)/i(dn). 



Further, 



E[z,^(o) I Ao] = x{t) + a fml^Vidu), 

Jo 



(3.17) 

(3.18) 
(3.19) 

(3.20) 
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where {V{t))t>o is the intensity process of the first individual. Therefore, we arrive at 

•>^2(ai,a2) = ha,.a,{t) + (aiaa) T C»;^'(ai, a2)/i(dn), (3.21) 

Jo 

where 



t pt 



ha,,a2{t) = aiasCov ^ m^^„(ai)P(dn), m^Z Ja2)P((iu)J . (3.22) 
Iterating this equation yields (3.11). 

As before, for V denoting the off'spring distribution point process (given by (1-6)) and for every 
function F : M — ^ M, note that 

/ F{x)V{dx)^ f{Yi), (3.23) 

JO 

where 11 is a rate 1 Poisson point process, f{x) = F{x^) and where the function / applied to a 
point process 11 is defined as /(H) = ^xen fi-^)- properties of functionals of the Poisson 
point process, we have that 

(ft ft \ ft^/" ft 

/ Fi{u)V{du), I F2{u)V{du)] = / Fi{u')F2{u')du = s'^ j u^''^^ Fi{u)F2{u)du. 
Jo Jo J Jo Jo 

(3.24) 

Therefore, we obtain 



ha,,a,{t) = ^ u'^'-'mf\{a,)mflMdu. (3.25) 
■s Jo 



The following proposition (adapted mainly from [18, Theorem 3.5]) captures all we require to 
know about the asymptotics of the mean and variance of a single-vertex characteristic x- 

Proposition 3.3 (Asymptotics of mean and variance for one-vertex characteristics) For 

regular single vertex characteristics X; o-i^d all a > 0, 
(a) As t ^ oo, 

e-^"'*E(zj^(a)) ^ a's\ / e-^'''yx{y)dy. (3.26) 



JO 

When a = at —7- 1, then the convergence holds where in the right-hand sides the value a = 1 is 
substituted. 

(b) As t ^ oo, when ai, 02 > with af + a| — af a| > 0, 

e-^('^i+'^i)*Cov(zf (ai), (as)) (3.27) 
(aia2)'+^A2s3 '■'^ 



(af + ai)V«(af +a|-afa|) Jo 



poo 

)e-^''i^dx / X2(x)e-^'^2^dx. 
Jo 



When a = at ^ (1, 1), then the convergence holds where in the right-hand sides the value a = (1, 1) 
is substituted. 

(c) With zt = |BPj|, there exists a random variable W with W > ^ a.s. such that e~^^zt converges 
almost surely and in to W and further for any single-vertex regular characteristic 

poo 

^_xt^x ^ / ^(a,)e-^^dx, (3.28) 
^0 

and the convergence also holds in L^. 
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Proof. Part (a) for a = 1 is [17, Theorem 5.2.8]. For a 7^ 1, we start from 

nt °° ft 

m^{a) = / x(i - n) V a''^i*Hdu) = xit - u)^ia{du). (3.29) 

^0 ._n ^0 



Define the measure with density pa(M)dii via the equation e ^°'"^fia{du) = pa{u)du+e ^"-'^doidu), 
then we obtain 



(3.30) 

By Lemma A.l(a-b), we have that Pa{u) is uniformly bounded on [1, 00) and bounded by cu^^^~^ 
on [0, 1], while and Pa(u) — )■ a* As when u — )• 00. Thus, by dominated convergence, 

/•oo 

e-^^''m^{a) ^ a" As / x{v)e-^''^''dv. (3.31) 



The proof when oj — t- 1 is identical. 

See [18, Theorem 3.5] for parts (b) for ai = 02 = 1 and for part (c). For the proof of part (b) for 
(01,02) 7^ (1, 1), we start with (3.11) and (3.12). By part (a), we have that 

r^^M^'iai)M^^{a2) /"°° u^/^" V^^^i+^^^^du, (3.32) 
s Jo 

where we define 

I'OO 

M^{a) = a'sX e-^^^^' xiy)dy. (3.33) 



00 



Further, note that, by (1.9), 

^i/.-ig-A(af+ai)«^^ = (al + a|)^i/"A-i/"r(l/s) = s{al + a|)-^/^ (3.34) 
Then we rewrite 

e-^("^+"^)*Cov(zf (ai),zP(a2)) = re-^K+'^i)(*-)/j^^^„^(t _ n)e-^('^i+'^2)-;i„^^^(dn). (3.35) 

Jo 

Now, by (A. 2), for u large, 

e-^("i+"2)"/iaia2(dtx) ~ (aia2)''Ase-^("?+"2-'^i'^2)«d-u, (3.36) 

which is integrable, so that substitution of this asymptotics in (3.11) and using dominated con- 
vergence, proves that 

POO 

e-^("i+'*2)tCov(zf (ai),zf (02)) ~ s{aia2)^+'M^^{ai)M^^{a2)ial + a'^y^^' / Ae-^^'^i+^^-'^i'^^^^dn 

Jo 

(aia2)i+^s 



(af + a|)i/^(af + a|-afa|) 



M^i(ai)M»(a2). (3.37) 



This proves the claim when +a| — 0^05 > 0. When a = at ^ (1, 1), then the above asymptotics 
holds with a = (1, 1) substituted on the r.h.s. since (3.32) holds with a = (1, 1) substituted on its 
r.h.s. ■ 
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3.3 Almost sure convergence of two-vertex factor: proof of Theorem 2.5 



In this section, we prove Theorem 2.5. Throughout the proof, we shall abbreviate p = 1/s — 1. 

,(1.2) _ T(l)f^ , 7'(2)CR^ _L r(3)^ 



Note that, for any fixed < e < -B < 00, we can write z^^'^^ as 



.r'=/r^(e,i?)+/r(i?) + /f(e), (3.38) 



where 



n'\B)= E {[t-T^] + [t-T,]r, 

J eBpf ' -.t-Tj >B ieBP^^' 

4''(^)= E E {[t-T^] + [t-T,]r. 

jeBPf>:t-T,<eieBP^^> 

Thus to prove the result it is enough to show that for each fixed e, B we have 

/B POO 
J (xi+X2)^/'-^e-^^ie-^^2dxi(ix2, (3.39) 

lim sup lim sup e"^^*/f' (B) = and lim sup lim sup e"^^*/f ' (e) = 0. (3.40) 
We shall start by proving (3.39). The following lemma shall be crucial in our proof: 

Lemma 3.4 (Sup convergence of characteristics) As t ^ 00, 



sup 

xe[e,B] 



i-oo 

e~^* V {x+[t-Ti])P -W^'^X ix + y)Pe-^ydy ^0, (3.41 

^0 



i6Bp(^) 



where W-^^ is the almost sure limit of e ^^z^^K 

Proof. Consider the (random) functions 

ft{x) = e-^' Y {x+[t-T^y xe[e,B]. (3.42) 

iSBPf) 

Note that for p < these functions are monotonically decreasing, while for p > they are 
increasing functions and they are all continuous when defined on the compact interval [£,B]. 
Further, for each fixed x £ [£,B], by Proposition 3.3(c), pointwise we have on a set of measure 
one, 

poo 

ft{x) ^ W^'^X / (x + yfe-^ydy. (3.43) 
Jo 

Thus to show the a.s. sup convergence, by the Arzela-Ascoli Theorem (see e.g. [21]), it is enough 
to show that the above family of functions are a.s. equicontinuous, i.e., for any x G [e, B] and any 
given 5 > there exists r]{x) > independent of t such that for all t: 

\ftix) - ft{y)\ < 6 forallyG [x-?7(x),x + 7?(x)]n[e,S]. (3.44) 

We separate between the cases p < 1 and p > 1. 



23 



Case 1: p < 1. In this case note that for any /i, Z2 > and a > 0, we have 
m + ay - ih + ay\ = \p-l\ t\x + aY-^dx 



rh 

<\p-l\ / x^^^dx since p - 1< 0. (3.45) 
Jh 

By the continuity of the function g{x) = x^ , for any x G [£,B] and 5' > 0, we can choose "//{x) 
smah such that for y £ [e, B],\y — x\ < r]'{x) we have 

\xP-yP\<6'. (3.46) 

This imphes from (3.45) apphed individuahy to the functions gi{x) = {x + [t — Tj])^/*"-*^ for 
i G BPf ^ that 

\ft(.x)-My)\<6'e-^'zl'\ 

where we recah that zl''' = |BPf^|. Since e ^^z^^^ converges a.s. and {e '^*z^^'}t>o is bounded a.s., 
we obtain that, on a set A of measure one, for each a; G ^, we can find a k{u}) depending on the 
sample point uj but independent of t, such that 

supe~'^*zf \w) < k{uj). 
t 

Now choosing 6' = 5/k{uj) gives us a r]{x) = rj{x,uj) such that (3.44) is satisfied. This proves the 
result for p <1. 

Case 2: p > 1. Here note that for any a > and x,y £ [£,B], we have, by the mean value 
theorem 

\{x + ay — {y + ay\ = p{z + ay~^\y — x\ a G [x, y] 
< p{B + ay~^\y — x\ since p — 1 > 0. 

This implies that, for x,y £ [£,B], 

\f,{x)-My)\<Ht\x-y\, (3.47) 

where, by Proposition 3.3(c), 



PCX) 

Ht = e-^'p V {B + [t - Ti])P-^ ^ pW^'U {B + yy-^Xe^^^dy. 
(1) -^0 

This proves that (3.44) holds also when p > 1, and completes the proof of Lemma 3.4. 



(3.48) 



Completion of the proof of (3.39). Write 

BPf (e, B) = {ve BPf ' : e <t-Tj < B}. (3.49) 

Then we have 

/•oo 

I^'\e,B)-e-^' W^'^ {[t-T,] + y)'/^-'Xe-^ydy <Q^{t)e-^'zi'\ (3.50) 

Jo 



jGBPf\e,B) 
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where 



Qi{t)= sup e"^* V (x + [t-Ti])!/^-! -lyw / {x + yf/'-^\e-^ydy. (3.51) 

Lemma 3.4 now implies that the term on the r.h.s. of (3.50) converges to a.s. Thus, to complete 
the proof, it is enough to show that 

poo pB poo 

,o, Jo Je Jo 



This follows by taking the characteristic 

' f^{a + X2ye-^''^dx2 if e<a<B, 



X2{a) 







if a ^ [e,B]. 



and using Proposition 3.3(c) for the branching process BPj^'. 



(3.52) 



Completion of the proof of (3.40). First consider the term (e). Note that 



(3.53) 



(1) 



Now note that by Proposition 3.3(c) 



{e + t- Tjf ^ \W^^^ [1 - e~^'] 



(1) 



jeBP^ 



when e 4, 0. This proves the last convergence result in (3.40). 



/ {e + yfe-^ydy 
Jo 



i>0, 
(3.54) 



To prove the first convergence result in (3.40), note that arguing as in the proof of (3.39), we 
have for all p and xi,X2 > 0, 



ixi + X2r <{2Pvl){xl + xP), 



where aV b = max{a, b}. Thus, 



{l)^XS,(2) I ,X,{1)^{2) 



(3.55) 
(3.56) 



where xb{x) = {B + x)^, xi^) = Now again, by Proposition 3.3(c), 

e-^*[zf'z,^_f/' + ^ VF^iyf^'e-^^ / Q-^^'iB + xfe-^^dx + i e-^'^x^e-^'^dx 



'0 ^0 
which converges a.s. to when B — )• 00. This completes the proof of (3.40) 



(3.57) 



3.4 Mean and variance of two-vertex characteristic 

In this section, we shall analyze two-vertex characteristics. This sets the stage for the proof of 
the the asymptotics for the hopcount in Theorem 2.7. Define, for a = (01,02), 



(3.58) 



where we recall that G''^(u) is the generation of vertex v G BPj . 
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Lemma 3.5 (Expectation and variance of two-vertex characteristics) Consider two in- 
dependent CTBPs BPj^^ and BP^^^ driven by the offspring distribution V . Then, 
(a) 



nzf''\a)] = [ [ {[t-v] + [t- u]rflaAdv)fia,{du). (3.59) 
Jo Jo 

(b) 



t ft 



"'^ Jo ""'^ 



CoY{zf''\a),zf''\h))= I h%{t-u,t)ilaMdu)+ I hf.{t-u,t)iia,i,,{du), (3.60) 

where 



h%{v, t-^^^ r r r f f ^P^^^ -U2] + [v-U- U^]Y{[t -V2] + [V-U- VlW 

s Jq Jo Jo Jq Jq 

X fiaiidui)fibiidvi)ila2{du2)fib2idv2)du, (3.61) 

(2) ,-(2) 



T i-V -(2) -(2) 

/,(^) t) = ^ nPE[z„^^;— (a2)^;^r"'^ {b2)]du, (3.62) 

a,o s Jq 

with p = 1/s — 1 and 

Xila.i^) = fit - ^2 + xYfLa,{dU2). (3.63) 

Jo 



Proof. We shall prove part (a) by conditioning on BPj^'. Note we can write z^^'^^a) 



zf''"^'' '(02), where 



Xt 



(i)(x)= CLf'''^^\x + [t-Tj\Y. (3.64) 



ai ' 



Conditionally on BP^^\ the characteristic x^tai deterministic. Therefore, Proposition 3.2(a) 
implies that 

E(zr'(«)|BPr)= E <h'''^'U\[t-u] + [t-T,]Y~^aAdu) = zf^-''\a,), (3.65) 

Jo 



where xfii characteristic 



X?L{v)= I {[t-u]+vYi:La,{du). (3.66) 







We complete the proof by noting that, for all r, 

m?*-^(ai) = Eiz^'-"^'' '(ai)) = / {[r - v] + [t - u]YiiaAdvr^aAdu). (3.67) 



JO JO 

Taking r = t proves the claim in part (a). 

For part (b), we again condition on BP^^\ for which we use the covariance partition 

Cov{zf^'\d),zY'\h)) (3.68) 
= Cov(E(zf •^'(a)|BPf'),.f •^H6)|BP«)) + E(Cov(zf 4^''' (&)|BPr)) = {I)t + (//)*• 

Let us now tackle each of these two terms separately. 
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Term For {I)t, we use the explicit formula for E(z^^'^'(a)|BPj^') in (3.65) and Xt^i2 ™ 
(3.66) to obtain that 

(/)* = Cov(zf-^'''(ai),zf*'^^'' (6i)). (3.69) 
Now using Proposition 3.2(b), we get 

{I)t= f [ h^^l{t-u,t)lia,b2{du). (3.70) 



Jo 



where v ^ h Uv, t) is the function 

a.b 

I (2) (2) 

(^, t) = ^ I u^ml'-;} {ai)m^,!:l' {h)du (3.71) 



s Jo 

02^2 r 



v—u rv—u 



■5 Jo Jo Jo 
0262 



s 




uP{[t -u2] + [v-u- ni])P([t - ^;2] + [t; - n - vi]^ 



JO JO JO JO 




X jla^ {dui)fjihr {dvi)fia2 {du2)iJ.b2 {dv2)du. 

12 ^ X^i '{2) 

Term {II)t- We again use that, conditionally on BPj^\ zf''^\d) = (02); where x^tai was 

defined in (3.64). Therefore, we can again use Proposition 3.2(b) to write 

Cov(zl^'^'(a),4^'^'(6)|BPf') = f g.^^{t-u,t)fi^2b,idn), (3.72) 







where 

Therefore, 
where 

We can now rewrite 

,(1) 



5s6("'*) = / ^''"^v'~uia2)m^!:u{bi){b2)du. (3.73) 



s 







rt 

{II)t= I hfJ,t-u,t)fia2b2{du), (3.74) 







T /-D (1) (1) 

h^^l^{v,t) = ^ I nPE[m^X'(«2)mJ^^i(62)]dn. (3.75) 



s 



^'"n«2)=/" V af"(^)(t-t.2 + t-T,)P^,,(dn2) (3.76) 
Jo 

/T (2) 

V af"(^'W (t-^/2 + t-r,)^/ia.(dn2) = zf"-"'^''\ai), 
M> ^0 



where 



X^!r,a2(^) = - ^2 + x)^'/i,,(dn2). (3.77) 

JO 

This completes the proof. ■ 



27 



Lemma 3.6 (Asymptotics of mean and variance of two-vertex characteristics) Consider 

^ and BP^ 



two independent CTBPs BP^^^ and BP^ driven by the offspring distribution V. Then, 



(a) 

poc poo 

e-^('^i+"2)*E(zf '''(a)) ^ (As)M / {xi + X2Te-^^<'''+'''^''^Uxidx2. (3.78) 

Jo Jo 

When a = at ^ (1, 1), then the convergence holds where in the right-hand sides the value a = (1, 1) 
is substituted. 

(b) When of + — ofal > ^''^d 6f + 61 ~ ^1^2 > there exists a constant ^cov(o5 b) such that 

g-A[(af+ai)+{6f+6i)]tCov(zf''>(a),Z,'^-'H6)) ^ AcoMM- (3.79) 

When a = at — )• (1,1), 6 = bt — )■ (1,1), then the convergence holds where the r.h.s. is replaced with 
AcoAt 1). 

Proof. By Lemma 3.5(a), 

g-A(a?+ai)ijE(^(i,2)(^)^ = e-^('^l+'^^)t f f\[t -u] + [t- v]fl'-^fLaAdu)fLa,{dv) (3.80) 

Jo Jo 



t rt 

e 



JO 
t rt 



X[al[t-u)+al(t-v)]^^^ - tx] + [t - v]f/'-^PaAy)PaAy)dudv + o(l) 
g-AKn+ai.]^^ + vf'-^PaAt " u)pa,{t - v)dudv + o(l), 

'0 JO 

where the o(l) originates from part 5u,o{du) in the decomposition e~^°-"^jla{du) = pa{u)du + 
e~'^°^'"(5o(iitt), and 5ufi is the Dirac measure at n = 0. Now we again use Lemma A. 1(a) and (b) 
together with dominated convergence to obtain the claim in part (a). The proof of part (b) is 
similar, and we omit the details here. ■ 

3.5 CLT for two-vertex characteristic: Proof of Theorem 2.7 

In this section, we use and extend the theory developed in the previous section to prove Theorem 
2.7. The plan is as follows: We shall start by proving the result for the CTBP in the summation 
instead of SWG^'^ and then argue that the difference is negligible. The result is formulated as 
follows: 

Theorem 3.7 (CLT for two-vertex characteristic for CTBP) The two-vertex characteris- 
tic satisfies that, as t —t- oo, 

£^P!",.)^,.BP;"i.,'^t^W ^ ^^^^^^^^ ^^ ^^^ 

Proof. Theorem 3.7 follows when we show that, writing at = (e"^/^*^''**, Q^'^Hs'^xt-^^ £qj, gQj^g 
vector Q = (ofi, a2)i 

' — ^ e"?/2+"i/^ (3.82) 

Indeed, define the (random) measure P on pairs (A, Y) by 

PiX <x,Y<y)= ^^Pr-(-)^.^BPf)(.)X.(t) _ ^^^^^^ 
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Then, (3.81) states that the pair {X, Y) converges in distribution to a pair of independent standard 
normal distributions, which, in turn, follows when, for each (01,02) € M, we have that 



(3.84) 



where E denotes the expectation w.r.t. P. 
In order to show (3.82), we show that 



,(1,2) 



(3.85) 



Together with Theorem 2.5, this then implies the result, as e~'^^^ z^^'^^ converges a.s. to a strictly 
positive random variable. Thus, we are left to prove (3.85). We shall show that the convergence 
in (3.85) in fact holds in L^. For this, it is immediate that it suffices to study 



Mt{d) = E z,(^'''(at)e-^"^+"^^^ - e-i/^+^i/^zt^-^) 
Qt{a) ^ \^v{zf''\atf), Ct{a) ^ Cov(z^^', zf'^' («*))■ 
In terms of these quantities, we can rewrite 

= Mtiptf + Qt(a)e-2("i+"2)v^ + e"?+"iQ,(0) - 2a(a)e-("i+"2)^e"?/2+°i/2^ 
Therefore, we shall prove that 



-2\t 



Mt(a) = 0(1), 



(3.86) 
(3.87) 

(3.88) 
(3.89) 



and 



e-^^*fQt(a)e-2('^i+"2)v^ + e"i+"2Q,(o) - 2C7t(a)e-("^+"2)^e"?/2+"i/2^ = ^(l). (3.90) 



For these proofs, the explicit computations of mean and covariances of z\ ' (St) and z 
z'l^'^\l, 1) are crucial. 

To prove (3.89), we rewrite 



,(1,2) 



-2At ; 



Since St = (e"i/v^, e"^/^^) 



(1,1), 



e-2^*E(zf •''(!)) ^ A, e-^('^i+'^2)*E(z,'^'')(a)) ^ A, 
where A is the limit in (3.78) in Lemma 3.6(a). By a second order Taylor expansion, 



{alXt - At - aiVXt) = Xt{e 



1 



) = a?/2 + o(l). 



(3.91) 
(3.92) 

(3.93) 



Together, these two asymptotics show that (3.89) holds. The proof of (3.90) is identical, now 
using Lemma 3.6(b) instead, and the fact that the limit equals Acov(l, 1) for all contributions, 
since at — )• (1, 1). ■ 
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Completion of the proof of Theorem 2. 7. We can bound 



(3.94) 



The random variable in the denominator converges in probabiUty to kW^^^W^"^^ > by Theorem 
2.5, so that it suffices to prove that the numerator converges to in probabihty. 

Denote 

ieswG^^' ieswGf 

Then, similarly to Proposition 2.2(c), and recalling that tn = T12 At* where = (2A)~^ logn + S 
for some > 0, we obtain that supj<;j^ (z"'^^'^^ — z^'"'^^') is tight. From Theorem 1.1 (whose 
proof has been completed since it relies only on Theorem 2.5, which was proved in the previous 
section), we know that the collision time Tyi is bounded by with probability 1 — o(l) as i? t 00. 
Therefore, 



e-^'* E E X.,(t)-e-2^* E X.,(t) = e-^^*(zr'^->-zr)^0. (3.96) 

jGBP[^' jeBpf ^ 

This completes the proof. 



A Appendix: auxiliary results 

In this section, we prove an auxiliary result on the asymptotics of the measure fiaidu) = 

Er=o«v*^(d^). 

Lemma A.l (Asymptotics of density of fla) (a) Let 

00 

g-Aa=« aj^l*j^du) = Pa{u)du. (A.l) 

i=i 

Then, for u G [0, 1], there exists a constant c such that Pa{u) < cu^^^~^ , while, for u > 1, pa{u) 
is bounded and as u ^ 00, 

Pi{u)^Xs. (A.2) 

(b) The following scaling identity holds: 

Pa{u) = a'p{ua'). (A.3) 

In particular, when — >■ 1, 

PaSu)^\s. (A.4) 

Proof. By (3.6), we have 



00 



^ r(j/s) 
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This form immediately proves the identity in (A. 3), and therefore also (A. 4) follows from (A. 2). 
Also, this form immediately shows that Pa{u) < cu^/^~^ for u € [0, 1]. Thus, we are left to prove 
(A.2). 

By [12, 8.327], we have that, as z — >• oo, 

z"-^/2g-.^ < Y{z) < z^^i/2e""^/2^(l + — ). (A.6) 

12z 

Therefore, 

oo -. 

= (1 + o(l))e-^" V u^/"^X^/'e'/'{j/s)-^/' (A.7) 

oo r-7 

= (1 + o(l))Ae-^«(An)-i ^ ^(Xusej-y/^, 

where the error term converges to as z — )• oo. Note that the r.h.s. is a function of Xu, so that 
it suffices to prove that 

oo ^ 

q(v) = e-^V-^y/^y^ _^^^/s\og{vse/j) _^ ^_ .^_g) 

i=o ^ ■' 

For this, we note that j i— >• is maximal when j = sv, where it takes the value e''. A 

second order Taylor expansion shows that when j — sv = x, we have 

Performing the approximate Gaussian sum leads to the claim in (A.8). ■ 
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